Conditional Decision Processes with Recursive Function
نویسندگان
چکیده
منابع مشابه
Reachability in Recursive Markov Decision Processes
We consider a class of infinite-state Markov decision processes generated by stateless pushdown automata. This class corresponds to 112 -player games over graphs generated by BPA systems or (equivalently) 1-exit recursive state machines. An extended reachability objective is specified by two sets S and T of safe and terminal stack configurations, where the membership to S and T depends just on ...
متن کاملRecursive Kernel Estimation of the Conditional Intensity of Nonstationary Point Processes
This paper develops adaptive nonparametric methods for analyzing seismic data. Kernel smoothing techniques are suitable for space-time point processes; however, they must be adapted to deal with the nonstationarity of earthquakes. By this we mean changes in the spatial and temporal pattern of point occurrences. A class of recursive kernel density and regression estimators are proposed to study ...
متن کاملMarkov Decision Processes with Arbitrary Reward Processes
We consider a learning problem where the decision maker interacts with a standard Markov decision process, with the exception that the reward functions vary arbitrarily over time. We show that, against every possible realization of the reward process, the agent can perform as well—in hindsight—as every stationary policy. This generalizes the classical no-regret result for repeated games. Specif...
متن کاملUnbiased Recursive Partitioning: A Conditional Inference Framework
Recursive binary partitioning is a popular tool for regression analysis. Two fundamental problems of exhaustive search procedures usually applied to fit such models have been known for a long time: Overfitting and a selection bias towards covariates with many possible splits or missing values. While pruning procedures are able to solve the overfitting problem, the variable selection bias still ...
متن کاملConditional Autoregressive Hilbertian processes
When considering the problem of forecasting a continuous-time stochastic process over an entire time-interval in terms of its recent past, the notion of Autoregressive Hilbert space processes (arh) arises. This model can be seen as a generalization of the classical autoregressive processes to Hilbert space valued random variables. Its estimation presents several challenges that were addressed b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Mathematical Analysis and Applications
سال: 1999
ISSN: 0022-247X
DOI: 10.1006/jmaa.1998.6192